Picture for Yuan Yao

Yuan Yao

Department of Mathematics, Hong Kong University of Science and Technology

MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection

Add code
Apr 30, 2025
Viaarxiv icon

Instance-Adaptive Keypoint Learning with Local-to-Global Geometric Aggregation for Category-Level Object Pose Estimation

Add code
Apr 21, 2025
Viaarxiv icon

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes

Add code
Apr 21, 2025
Viaarxiv icon

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

Add code
Apr 08, 2025
Viaarxiv icon

Coca-Splat: Collaborative Optimization for Camera Parameters and 3D Gaussians

Add code
Apr 01, 2025
Viaarxiv icon

GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior

Add code
Mar 14, 2025
Viaarxiv icon

Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning

Add code
Feb 19, 2025
Figure 1 for Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning
Figure 2 for Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning
Figure 3 for Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning
Figure 4 for Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning
Viaarxiv icon

Noise May Contain Transferable Knowledge: Understanding Semi-supervised Heterogeneous Domain Adaptation from an Empirical Perspective

Add code
Feb 19, 2025
Viaarxiv icon

UniMatch: Universal Matching from Atom to Task for Few-Shot Drug Discovery

Add code
Feb 18, 2025
Viaarxiv icon

Pushing the Boundaries of State Space Models for Image and Video Generation

Add code
Feb 03, 2025
Figure 1 for Pushing the Boundaries of State Space Models for Image and Video Generation
Figure 2 for Pushing the Boundaries of State Space Models for Image and Video Generation
Figure 3 for Pushing the Boundaries of State Space Models for Image and Video Generation
Figure 4 for Pushing the Boundaries of State Space Models for Image and Video Generation
Viaarxiv icon